Regularization for Cox's Proportional Hazards Model with Np-dimensionality.
نویسندگان
چکیده
High throughput genetic sequencing arrays with thousands of measurements per sample and a great amount of related censored clinical data have increased demanding need for better measurement specific model selection. In this paper we establish strong oracle properties of non-concave penalized methods for non-polynomial (NP) dimensional data with censoring in the framework of Cox's proportional hazards model. A class of folded-concave penalties are employed and both LASSO and SCAD are discussed specifically. We unveil the question under which dimensionality and correlation restrictions can an oracle estimator be constructed and grasped. It is demonstrated that non-concave penalties lead to significant reduction of the "irrepresentable condition" needed for LASSO model selection consistency. The large deviation result for martingales, bearing interests of its own, is developed for characterizing the strong oracle property. Moreover, the non-concave regularized estimator, is shown to achieve asymptotically the information bound of the oracle estimator. A coordinate-wise algorithm is developed for finding the grid of solution paths for penalized hazard regression problems, and its performance is evaluated on simulated and gene association study examples.
منابع مشابه
Novel Harmonic Regularization Approach for Variable Selection in Cox's Proportional Hazards Model
Variable selection is an important issue in regression and a number of variable selection methods have been proposed involving nonconvex penalty functions. In this paper, we investigate a novel harmonic regularization method, which can approximate nonconvex Lq (1/2 < q < 1) regularizations, to select key risk factors in the Cox's proportional hazards model using microarray gene expression data...
متن کاملRegularization Paths for Cox's Proportional Hazards Model via Coordinate Descent.
We introduce a pathwise algorithm for the Cox proportional hazards model, regularized by convex combinations of ℓ1 and ℓ2 penalties (elastic net). Our algorithm fits via cyclical coordinate descent, and employs warm starts to find a solution along a regularization path. We demonstrate the efficacy of our algorithm on real and simulated data sets, and find considerable speedup between our algori...
متن کاملA cocktail algorithm for solving the elastic net penalized Cox's regression in high dimensions
We introduce a cocktail algorithm, a good mixture of coordinate decent, the majorization-minimization principle and the strong rule, for computing the solution paths of the elastic net penalized Cox’s proportional hazards model. The cocktail algorithm enjoys a proven convergence property. We have implemented the cocktail algorithm in an R package fastcox. Numerical examples show that cocktail i...
متن کاملA novel L1/2 regularization shooting method for Cox's proportional hazards model
Nowadays, a series of methods are based on a L1 penalty to solve the variable selection problem for a Cox’s proportional hazards model. In 2010, Xu et al. have proposed a L1/2 regularization and proved that the L1/2 penalty is sparser than the L1 penalty in linear regression models. In this paper, we propose a novel shooting method for the L1/2 regularization and apply it on the Cox model for v...
متن کاملSheppard's correction for grouping in Cox's proportional hazards model
Cox's proportional hazards model is often t to grouped survival data, i.e. occurrence/exposure data over given time intervals and covariate strata. We derive a Sheppard correction for the bias in the grouped data analogue of Cox's maximum partial likelihood estimator. This is done via a large sample theory in which the covariate strata and time intervals shrink as the sample size increases.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Annals of statistics
دوره 39 6 شماره
صفحات -
تاریخ انتشار 2011